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Abstract 

In this paper, we consider a particular class of selective fading channel corresponding to a 
channel that is selective either in time or in frequency. For this class of channel, we propose a 
systematic way to achieve the optimal DMT derived in Coronel and Bolcskei, IEEE ISIT, 2007 
by extending the non-vanishing determinant (NVD) criterion to the selective channel case. A new 
code construction based on split NVD parallel codes is then proposed to satisfy the NVD parallel 
criterion. This result is of significant interest not only in its own right, but also because it settles a 
long-standing debate in the literature related to the optimal DMT of selective fading channels. 

Index Terms 

Diversity multiplexing tradeoff, selective channel, code construction, cyclic division algebra, 
non vanishing determinant (NVD) code. 

I. INTRODUCTION AND MOTIVATIONS 

In this paper, we consider the selective fading MIMO channel where a transmitter having 
n t antennas wants to communicate with a receiver having n r antennas. We assume that the 
communication occurs on a channel that exhibits memory either in time or in frequency. Our 
objective here is to construct reliable coding scheme for the high data rate communication in 

Part of this work was performed while the first author was with the Communication Theory Goup in ETH Zurich. Lina 
Mroueh is now at the department of signal, image and telecommunications in the Institut Superieur d'Electronique de Paris 
ISEP, France and Jean-Claude Belfiore is with the department of communication and electronics in Telecom ParisTech, 
France. This paper was presented in part at the IEEE Information Theory Workshop (ITW) in Sept. 2010, Dublin, Ireland. 
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the high SNR regime when the channel is not known at the transmitter side. The performance 
criteria to evaluate the coding scheme that will be used in the following is the well-known 
diversity multiplexing tradeoff (DMT). 

The diversity multiplexing tradeoff (DMT) proposed by Zheng and Tse in [0Q is a powerful 
approach to characterizing the dual benefits in terms of diversity and spatial multiplexing in 
the high SNR regime. In order to achieve the optimal diversity multiplexing tradeoff for the 
flat fading MIMO channel, Belfiore et al. introduced the non-vanishing determinant criterion 
in 0. Later, Elia et al. [|3l proved that this criterion is a sufficient condition to achieve the 
optimal DMT using a full rate code. 

While most of the above results address the case of flat fading channels, the general channel 
model of time-frequency selective channels has been considered by Coronel and Bolcskei 
in BH, [51 where the optimal DMT is derived. Moreover, a DMT optimal coding scheme 
based on a joint precoder and parallel codes construction, is proposed. As the block fading 
channel is a special case of the time-frequency selective channel, it is expected that the DMT 
expression in matches with the corresponding result in (T). This is, however, not the 
casej and has given rise to lots of debate in the literature e.g. 10. A rigorous interpretation 
of this incoherence in results remains an open problem. The present paper settles the issue 
and shows that the DMT derived in (5]| is, indeed, achievable. 

Contributions: We consider a particular class of the general channel model considered 
in 01, where the channel is selective either in time or in frequency. For this class of 
channels, we propose a systematic way to achieve the optimal DMT by extending the non- 
vanishing determinant criterion to the selective channel case. A new code construction based 
on split NVD parallel codes is then proposed to satisfy the NVD parallel criterion. Moreover, 
for the block fading channel, we provide an extension of the geometrical interpretation to 
show the achievability of the optimal DMT. This result is of significant interest not only 
in its own right, but also as it shows that the optimal DMT in [5| is achievable for all the 
classes of fading channels including the block fading channel. 

Outline of the paper: The rest of the paper is organized as follows. In Section flU we 
define the selective fading channel model. We review in Section [HI] some basic preliminaries 
and background materials that are essential to the development of this paper. Then, we derive 
in Section [IV] the limiting outage bound on the achievable DMT. We derive in Section [V] 

'The optimal DMT expression in is larger than the one in llD . 
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the code design criterion required to achieve this optimal DMT for this class of selective 
channels and propose a new family of split NVD parallel codes to satisfy this code design 
criterion. Finally, Section [VTIl concludes the paper. 

Notation: The notation used in this paper is as follows. Boldface lower case letters v denote 
vectors, boldface capital letters M denote matrices. denotes conjugate transposition. 
denote the transposition operator. ||H||| = Tr{HH^} is the Frobenius norm of a matrix. 
Tr{A} refers to the trace of matrix A. Ijv stands for the NxN identity matrix. diagjAnj-^TQ 1 
denotes the block diagonal matrix containing A n on its diagonal. vecA = [a^ . . . a^]t T l, 
and aj is a column vector of matrix A. The non zero eigenvalues of A ordered in ascending 
order are denoted by Aj(A). CM represents the complex Gaussian random variable. is the 
mathematical expectation w.r.t. to the random variable X. Equality in distribution between 
two random variables X and Y is represented by X ~ Y. Exponential equality is denoted by 
fix) = x b , i.e. lim^; .5.00 '"f^ = b, and >, < denote the exponential inequality. 1*4.1 denotes 

log X 

the cardinality of a set A. Finally, A <g> B denotes the Kronecker product of the matrices A 
and B. 



We consider the general case of selective fading channel which includes the case of time 
and frequency selective channel. In order to deal with such type of channels, techniques that 
decompose these channel into parallel sub-channels are generally used in literature Q. The 
input-output relation for the class of channels considered in this paper is therefore given by 



where n = 0, 1, . . . , N — 1 represents the sub-channel n, the sub-channel H, t r is a 
n t x n r MIMO channel that remains constant during all the duration of the transmission T, 
X n represents the transmitted signal, and Z n denotes the additive i.i.d. CJ\f(0, 1) noise. The 
channels H n are correlated across the sub-channels n = . . . N — 1 according to, 



where Re is the NxN correlation between the scalar sub-channels characterized by its 
rank equal to p < N, H w is an n r x Nn t matrix with i.i.d. CM(0, 1) entries. The transmitted 
signal satisfies the following power constraint, 



II. Channel and signal model 




(1) 



H = [H . . . Hjv-i] = H w (Rjj/ 2 ® I nt ), 



(2) 




(3) 



i=0 
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Throughout this paper, we set m = min(n t ,n r ) and M = ma.x(n t ,n r ). 

The input-output relation considered in (Q~|) models the case when the channel is selective 
either in time or in frequency. For the frequency selective channel, the MIMO OFDM system 
decomposes the channel into N parallel subcarrier, where N represents the total number of 
subcarriers and n stands for the frequency. The sub-channel remains constant over each 
subcarrier and the correlation matrix Re is a circulant matrix having a rank equal to L 
which is nothing but the number of channel taps or the memory of the selective channel. 

For the time selective case (or the block fading channel), the channel remains constant 
during a block n of T time slots and changes in a statistically independent manner across 
blocks. For this case, N represents the total number of blocks and Re = Ijv with full rank N. 

III. Preliminaries and background 

In this section, we start by recalling some basic preliminaries on the optimal diversity 
multiplexing tradeoff (DMT) of the code in Subsection IIII-AI and on the limiting outage 
bound of the selective fading channel in Subsection IIII-BI Then, we briefly review prior 
results from literature that motivate our contribution. 



A. Diversity multiplexing tradeoff (DMT) 

Let Afp(SNR) be a family of coding schemes operating at a given SNR, and let i?(SNR) 
denote the rate transmitted per sub-channel, such that, 

i?(SNR) = r log SNR, 

where r is the multiplexing gain per sub-channel. 

The diversity multiplexing tradeoff (DMT) of the coding scheme A'p(SNR) is defined as 
the SNR exponent of the error probability P e ,x p ( r ; SNR) using maximum likelihood-decoding 
such that 

log P e ,x p {r, SNR) 

a[r) = — hm . 

V ' SNR->oo log SNR 

For a given multiplexing gain r, the optimal DMT is the largest DMT supported by any 

coding scheme, and is is denoted by d*[r). 
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B. DMT outage bound 

The outage probability of a selective fading channel when the target rate R scales as 
r log SNR is defined as, 

Pout (r) =P{ log det (l N +-^-HH^ < Nr log SNR j , 

where H. = diagjHnl^rQ 1 is the block diagonal channel matrix. 

The optimal DMT of the selective fading MIMO channel has been derived in H and 0. 
For this general case, Coronel and Bolcskei showed that the outage probability is bounded 
as. 

P e ,x p (r) > Pout(r) > Pj(r) = SNR"^ (4) 

where, 

dj{r) = (pM — r)(m — r). (5) 

and m = mm(n t ,n r ) and M = max(n t ,n r ). Note that the first inequality in © is a 
consequence of the Fano inequality HI, and the second inequality is a consequence of the 
Jensen inequality as shown in |@]|. Moreover, a coding scheme that achieves the bound called 
"Jensen bound" in the terminology of [5] has been proposed in [|4]| and 0. It follows therefore 
from [5J that the optimal DMT is equal to, 

d(r) = dj{f) = (pM — r)(m — r). 

C. Previous work and motivations 

The block fading channel is a particular case of the selective fading channel model 
considered in (OQ) with covariance matrix R H = I N . The optimal DMT expression is therefore 
d*{r) = (NM — r)(m — r), which is the DMT expression of the general channel model 
considered in flU, applied to this particular channel setting. Obviously, this result does 
not match with the corresponding result in (T), i.e., di(r) = N(M — r)(m — r) < d*(r), Vr. 
This incoherence in results has been subject to lots of debate in literature e.g. [6] and motivates 
our contribution. The authors of (61 base their arguments on a non-accurate outage probability 
derivation (P ou y( r ) = SNR~ d!(r) ) to claim that the DMT of the block fading channel cannot 
exceed di(r) < d*(r). In order to settle this issue, we show in this paper that the DMT in 
0~1 is not a limiting outage bound as claimed in [6], and that the DMT in is achievable 
using codes derived from cyclic division algebra (CDA). 
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IV. Outage bound on the DMT of selective fading channel 

Unlike the flat fading channel, the analytical outage probability for the selective fading 
channel cannot be easily derived using the eigenvalues distribution. For the case of correlated 
parallel sub-channels, Coronel and Bolcskei in [|4|| generalize the geometrical interpretation 
in [HI to the selective fading case. For the particular case of the statistically independent 
parallel sub-channels which is the block fading channel, the analytical outage probability 
should be carefully performed to take into account the impact of coding across the blocks, 
which cannot be easily seen using the block diagonal structure of the matrix. For this, an 
equivalent expression of the outage probability is first derived. Then, we provide here an 
outage derivation based on the geometrical argument previously used for the flat fading 
channel in 0]] and for the selective fading case in Hi. 



A. Outage bound of the block fading channel 

For the block fading channel, the outage probability is, 

Pout (r) = P {log det (i+^-ft?^) < Nr log SNr}, 

where H = diagjHnj^TQ 1 is the block diagonal channel matrix. 

1 ) Equivalent outage expression: In order to generalize the geometrical interpretation in flU 
to the block fading channel, we start first by finding in Lemma \T\ an equivalent expression 
of the outage probability. 

Lemma 1: For the block fading channel, the outage probability is equivalent to, 

SNR 
h 



P out (r) = P{ log det (I + §^C H C^) < Nr log SNr}, 



(6) 



where 



C H 



H 



w,0 



H 



H 



w,N-l 



(7) 



H TOi i rl Wi 2 ■ ■ ■ H TO) o 
and H w> i, i = ... N — 1 are Gaussian matrices with i.i.d. entries. 

Before going to the rigorous proof, we note here that the main intuition behind this lemma 
is the fact that the block fading channel can be considered as a selective fading channel with 
a channel memory of N blocks. This is so far the case as the covariance matrix is equal to 
identity, which is a full rank matrix with rank equal to N. 
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Proof: To prove this lemma, we consider hy the N x 1 Gaussian vector ~ CjV(0, Ijv) 
containing the TV independent channel realisations between transmit antenna j and receive 
antenna i. It is well-known that the Gaussian vector h y is identically distributed as Fh^y 
for any unitary matrix F, i.e., hy ~ Fh^^Vz, j. 

In the following, we specify our result to the case where F is a N x N Fast Fourier 
Transform (FFT) matrix. This means that each channel realisation is identically distributed 
as, 



13 



N-1 

?= E * w 



-J27ri 



n = 0...N - 1. 



The block diagonal matrix "H is therefore identically distributed as D H , z.e., "H ~ D H , where, 



D H 



TV 



7V-1 



z=o 



7V-1 



(8) 



z=o 

with uji = e'i^r and H wi = (/i'Jji^Kn^i^nr 

Consequently, the mutual information is identically distributed as, 

/(x,y|H) ~ logdet (l+^D H D^ = 7 D (SNR). 

By using an FFT precoder and an FFT equalizer as in an OFDM system to transmit over the 
channel D H in ([8]), the matrix D H DH^ can be made unitarily equivalent to ChC^, where 

H m ,o Hw,i ■ ■ ■ H 

w.N— 1 

t±w,N-l tlw,0 ■ ■ ■ H^jv 

Ch 

H Wi i tl w> 2 ■ ■ ■ tx w ,o 
Thus, the corresponding mutual information To (SNR) can be written as, 

(SNR \ 
I+^ChCU ~ J(x,y|H). 

It follows therefore that the outage probability is such that, 

P out (r) = P { log det ( I +^ChC1j) < Nr log SNR } . 
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2) Geometrical interpretation: Following the geometrical interpretation of the flat fading 
channel in [Q]|, the typical outage event occurs when the channel matrix C H is close to the 
manifold of all matrices with rank Nr denoted by TZ Nr, such that, 

K Nr = {C H : rank{C H } = Nr}. 

By following the same reasoning as in 0]|, this requires that the d(r) components of C H 
orthogonal to TZNr to be collapsed, i.e., be on the order of SNR -1 . The probability of this 
event is P ut(?") = SNR -0 "-^. The number of these components is given by 

d(r) = NMm - dim(K Nr ), 

where dim(7^Ar r ) is the sufficient minimal number of parameters required to specify matrix 
C H with rank Nr. 

3) Dimensionality of TZnt -' We first note that due to the structure of Ch in ©, the 
number of parameters required to characterize a matrix C H in TZnt- is equal to the number of 
parameters required to specify an m x NM matrix (m = mm(n t , n r ) and M = m&x(n t , n r )) 
with rank r that contains the n t first columns if n t <n r , and the n r first rows if n r < n t as 
shown in Figures |l(a)| and |l(b)| . 



Nn t 





H w .o 


H w i 


H w N 1 



H w .n-i H w0 
H w .i H w2 



H w> o 



(a) Case 1: n r < n t 



Nn r 



H w o 

H W N 1 

H w i 



H w i 
H Wi o 



H 



w.2 



H Wj N 1 
H W .N 2 



H 



w.O 



(b) Case 2: nt < n r 



Fig. 1. It is sufficient to specify a m x NM matrix with rank r, with m — min(n t , n r ) , M = max(n t , n r ) to characterize 
a matrix Ch with rank Nr. 



Characterizing a matrix C H with rank Nr reduces therefore to the problem of characterizing 
a matrix of dimension m x NM with rank r that requires only NMr + (m — r)r, i.e, 

dim(7?.jvr) = NMr + (m — r)r, 

where MNr is the number of independent parameters needed to identify r independents 
vectors and (m — r)r parameters are needed to identify the linear dependent vectors as a 
function of the r independent vectors. It can be be easily verified here that the MNr free 
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i.i.d. Gaussian parameters that identify the r linear independent vectors generate a block 
circulant matrix with rank Nr with a probability equal to one. 

It can be deduced that the optimal DMT for the class of block fading channel is, 

d out (r) = NMm - &im(K Nr ) = {NM -r)(m-r). 

B. Comments on related work's derivation 

It turns out fron the geometrical interpretation that the outage event is reduced to the 
probability that the m x NM Jensen channel, denoted by H w in the rest of the paper, is in 
outage, which is the Jensen outage event in the Coronel and Bolcskei terminology [5]. This 
means that the outage event is reduced to, 

C(SNR) = {U w e C mxNM is in outage} 

Note that the straightforward generalization of the flat fading outage results to the block 
diagonal matrix in © as in [[TJ and (6J does not take into account the impact of the coding 
among the channel blocks in the analytical outage derivation and does not lead to an accurate 
outage probability expression. In the following, we show how this optimal DMT can be 
achieved using a code derived from cyclic division algebra (CDA). 

V. DMT ACHIEVABILITY: SPLIT NVD PARALLEL CODES FOR SELECTIVE FADING 

CHANNEL 

In this section, we propose a new family of split NVD parallel codes to achieve the optimal 
DMT of (pM — r)(m — r). We start first by deriving in Subsection IV-AI a sufficient condition 
on the code to achieve the optimal DMT for this class of channel. The new family of proposed 
codes is based on the previously known NVD parallel codes family which we will briefly 
review in Subsections IV-BI and IV-CI Finally, the code construction and the optimality of the 
split NVD parallel code is addressed in Subection IV-EI 

A. Optimal code design criterion 

Unlike the case of time-frequency selective channel in |'4j, we show here that when the 
channel is selective either in time or in frequency, there is no need to construct an additional 
precoder adapted to the channel statistics in order to achieve the optimal DMT. The optimal 
code design criterion required to achieve the optimal DMT is summarized in the following 
theorem. 
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Theorem 1 (Sufficient condition for DMT achiev ability): A coding scheme X £ <Y P (SNR) 
achieves the optimal DMT (pM — r)(m — r), if for any two different codewords X, X £ 
Afp(SNR), the eigenvalues of the block diagonal matrix DD^, where D = diag{(X n — 
Xn)}^ 1 satisfy 

in ^ 

x.x 6 fU> II A -< DDt ) * 2 «Ms«r < 9 > 

Proof: The proof of this theorem uses the same steps as the proof of [Theorem 1 in flU] 
and is detailed in Appendix [A} 



B. NVD parallel scheme 

Let X = diagjXnj^TQ 1 £ <Y P (SNR) be the block diagonal matrix containing the transmit- 
ted codeword Xj in (QQ), and constructed such that X = 9 3, where 9 is a scaling factor that 
depends on the structure of the code, and chosen to ensure the power constraint in ©. The 
block diagonal matrix 3 = diagjSj}^^ 1 is an NVD parallel code denoted by C(SNR), and 
defined as follows: 

Definition 1 (NVD parallel scheme): Let ^4(SNR) be an alphabet that is salably dense, 
such that 

Vs £ .A(SNR) \s\ 2 < |.A(SNR)|. 

Then, C(SNR) is called NVD parallel code if, 

1) Each entry of S is a linear combination of symbols carved from .A(SNR). 

2) The total number of transmitted symbols carved from .A(SNR) is equal to TNn t . 

3) For any pair of different codewords 3 and 3 £ C(SNR), the NVD property is satisfied 

det((3-3)(3-3) t ) > k>0, (10) 
with k is a constant independent of SNR. 

2 We assume here without restriction that the signal constellation is a QAM constellation, i.e, .4(SNR) = _4qam(SNR). 
This can be also extended to the case of HEX constellations. 
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C. Cyclic division algebra (CDA) code structure 

We recall here the most relevant concepts of the construction of the codeword matrix 
3 = diagjS,}^) 1 based on cyclic division algebra. We refer the reader to (8), [HI for more 
details on the NVD parallel code construction. In the following, we consider, 

- The field F as a Galois extension of degree N over Q(i), and that have r as generator, 
such that 

Gal(F/Q(z')) = {t ,..., tjv-i}. 

- The field K is a cyclic extension of degree n t over F, and that have a as generator, such 
that 

Gal(K/F) = {a ,..., a"'- 1 }. 
The code 3 is constructed by setting 3j = 7j(S), i.e., 



(11) 



Tjv-l(2) 

where S belongs to the cyclic division algebra C = (K/F, a, 7), and 7 G F chosen such that 
7, 7 2 , . . . , 7 nt_1 are not norms of an element of K. The matrix 3 is defined such that 



/ 



X £1 

7a(x„ t _i) a(x ) 



\ 7a nt " 1 (a;i) -ia nt - 1 (x 2 ) ... a^ixo) J 

where, X{ = J2f=i s i,j u ji s i,j e -4(SNR) and Uj E K. For the NVD parallel code, the 
determinant is such that, 



det(diag{3 i }il 1 ) =Ij7ifc(det(a)) 



k 

= iV F /Q (i )(det(S i )) G Z[z], 
and which is equal to zero if and only if all Xi are zeros. It follows that for 3 7^ 

|det(3)| 2 > SNR°. 

We finally recall that the NVD parallel codes preserve the mutual information as, 



vec 



'■=,in 



••• 7-jv_i(3) 



[T] 



$ S 



October 14, 2010 



DRAFT 



12 



where $ is an orthogonal matrix, such that = Ijv nt - It follows therefore that the 



mutual information between the vectorized input vectors x = vec( 



^0 ' ' ' ^N-l 



and 



1 • • • 1 N-l 



the vectorized output y = vec 

/ X INI h{ 

/(x,y|H) = logdet ( I N + WH ] 



is, 

SNR. 

N-l 



where H = diag{H n } n=0 is the block diagonal channel matrix. 

D. Choice of 9 for NVD parallel codes 

Following the same reasoning in and [6]|, the scaling factor 9 that insures the power 
constraint in © is such that, 

N-l 

9 2 j2n\mi}<TN. 

i=0 

Due the linearity of this code and to the use of unit transformation, each entry of x E S is 
such that, 

E[|x| 2 ] = E[\s\% s G ^4qam(SNR), 
2(|.4(SNR)| - 1) 



This implies that, 



N-l 



}^E[||S,|||] = TNE[\x\% 

i=0 

= 7W|.A(SNR)|. 
The scaling factor 9 that ensures the power constraint is therefore, 

9 2 = l^(SNR)!- 1 . (12) 

Using the NVD parallel criterion in (flOl) and the value of 9 2 in (fT2l) . the eigenvalues of 
the block diagonal matrix D = X — X = 9(S — 3) for any different codewords X, X, are 
such that, 



|det(H-H)| 2 



Y\ A i (DD t 

i=l 



> 



|^(SNR)|^"' - |^(SNR)| W ™'' 



Due to the power constraint in ©, these eigenvalues necessarily satisfy Aj(DD^) < 1. Then, 
the NVD parallel criterion is equivalent to, 



m ^ 

nA,(DDt) > (13) 



mm 
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It can be easily verified that the NVD parallel criteria of the NVD parallel code depends 
critically on the size of the constellation. The natural question that comes here is: What is the 
optimal size of constellation that guarantees to transmit a rate i?(SNR) over each sub-channel 
and that meets the sufficient condition of DMT achievability in ©. 

E. Split NVD parallel codes and optimality 

The NVD parallel codes as put straightforwardly by Lu in [Q and Yang et al. in [|8l are 
sub-optimal, as the DMT achieved by these codes is only p{n t — r){n T — r) < (pM — r){m—r). 
The main idea of the new split code construction is to design a coding scheme that guarantees 
to transmit a rate of i?(SNR) using a total power of SNR over each sub-channel and to satisfy 
the NVD parallel criterion in Theorem \T\ The two possible ways of splitting the data over 
the parallel channels are detailed in Subsections IV-E1I and IV-E2I 

1 ) Block diagonal NVD parallel code: The first way of splitting the data over the parallel 
channels has been previously studied in [9] and is depicted in Figure [2] 

TNn t symb NT slots 



NR 




n = N - 1 



Fig. 2. Coding across time and frequency: The total rate is transmitted only during T slots. Each entry of 11 (H) is a linear 
combination of symbols carved from ^4d(SNR) where |_4d(SNR)| = SNR"* . In this case, X e>t j = 6dS*d- 



In this case, the total rate NR is transmitted during only T slots over each sub-channel. 
It can be easily verified that for this scheme the outage event is such that, 

d (r, SNR) = (Ji(x,y|H) < Nr log SNR} , 

where, 

SNR 

/i(x,y|H) =logdet (l N + WH ] ). 

n t 
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Each block Tj(E) contains TNn t symbols carved from a signal constellation ^(SNR). 
In order to maintain a rate of i?(SNR) over each sub-channel, the size of the constellation 
|^4d(SNR)| should be chosen such that, 

1 



i?(SNR) = rlogSNR 



NT 



log|^(SNR)r tT7V . 



i.e., |^4d(SNR)| = SNR n « . It can easily be verified that for this choice of signal constellation 
size, the NVD parallel criterion in (PT3T) is, 



mm 



IMDD 



> 



2A r i?(SNR)+o(SNR) ' 



X.XeA'p(SNR) . 

Obviously, the sufficient condition in Theorem \T\ is not satisfied in this case. The achievable 
DMT by this transmission scheme is only p(n t — r)(n r — r) as shown in [|9]|, and it is therefore 
sub-optimal. 

2) Split NVD parallel code: The second way we propose to split the data that guarantees 
to transmit a rate of i?(SNR) using a total power of SNR over each sub-channel is shown 
in Figure [3j In this case, the total rate is split equally among all the NT slots. Each block 
Sj transmits TNn t symbols carved from a signal constellation A, (SNR). The same TNn t 
symbols are transmitted over blocks 3j ... Tjv-i(Ej) but encoded differently. However, 
different symbols are transmitted over two different blocks Ej and Sj. 



TNn t symb 



NT slots 



R 



R. 



R 



n 



/N 



n = 1 



TN-iiai. 



n = N — 1 



Fig. 3. Coding across time and frequency: The total rate is split across the NT slots. Each entry of rj(Si) is a linear 
combination of symbols carved from ^4 S (SNR) where |.4 S (SNR)| = SNR""' . In this case, X e , s = 9 S H S . 



For this transmission scheme, the outage event occurs when at least one of the NVD 
parallel code scheme with rate i?(SNR) = r log SNR is in outage, meaning that, 



AT-l 



(9 2 (r,SNR)= |J a(r,SNR), 



s=0 
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where, 

a(r,SNR) = |l/ 2 (x,y|H) < rlogSNRj , Va, 

and, 

SNR 

J 2 (x, y |H) = log det ( 1 N . 

Note that the normalization factor 1/N in the first side of the inequality in the outage event 
O a (r, SNR) traduces the fact that N blocks are needed to decode the information of each 
NVD parallel code with rate i2(SNR). 

Using the union bound and the inclusion bound (O s C 2 ), the outage probability can be 
bounded as, 

N-l 

P(O s )<F(0 2 )<J2HOs) (14) 

i=0 

Assuming that P(C S ) scales as SNR~ ds(r) , it follows from (O that at high SNR, 

P(C 2 ) = SNR" dsW = P(O a ) = P(Ci), 

This implies that this scheme is equivalent in term of outage to the first scheme. 

In order to maintain the rate of i?(SNR) over each sub-channel, the signal constellation 
Aj(SNR) should be chosen such that, 

R(SNR) =rlogSNR= - log |A(SNR)| ni ™. 

The size of the signal constellation for the split NVD parallel scheme is therefore reduced 
compared to the block diagonal case, and 

| A (SNR) | = SNR^F = |^(SNR)|w. 

Due to the block diagonal channel matrix structure, it can be deduced that the split NVD 
parallel code is equivalent to a concatenation of N independent parallel NVD codes, where 
the symbols of each NVD parallel code are carved from a constellation ^4 S (SNR) with size 

r 

SNR^"* . The system is in error if at least one of the NVD parallel codes is in error, i.e., 

e(r,SNR) = (J SNR), 

i=0 

where e(r, SNR) represents the event that the system is in error and £j(r, SNR) denotes the 
event that the i th NVD parallel code formed by the blocks Hj . . . T7v_i(Sj) is in error. For 
each NVD parallel code with symbols carved from A, (SNR), it can be easily verified by 
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replacing the cardinality of ^4 S (SNR) in (fT3l) that the NVD parallel criterion in Theorem Q] 
is satisfied, i.e., 



X,X e ^(SNR) f = \ 1 ^ - 2 ^(SNR)+o(SNR) 



It follows from Theorem Q] that, 

Pfo) = SNR"* (r) , 
where dj(r) = (pM — r)(m — r), VI 

Using the inclusion and the union bound as for the outage analysis in (PT4l) . it follows that, 

P e (r, SNR) = P(e) = SNR" d(r) , 
with d(r) = di(r) = (pM — r)(m — r). 

The split NVD parallel codes in Figure [3] achieve therefore the optimal DMT of (pM — 

r)(m — r). 

VI. Numerical results 

In order to compare the performance of the split NVD parallel code with the classical 
NVD parallel code, we consider the case of 2 parallel 2x2 MIMO channel, i.e. a block 
fading channel with a total number of blocks equal to 2. 

The structure of the NVD parallel code for this configuration is given in [[HI, such that 

X= ^ ° (15) 
V r(S) J 

where S is given in ([161) with 9 = 9 = a — 1 + i — iO, a = 1 + % - %9 and 

( 8 = e 1 ^. The channel matrix r(S can be deduced from S by replacing ( 8 by — Cs- 



1 / a(si + s 2 Cs + s 3 9 + S4( 8 9) a(s 5 + s 6 ( 8 + s 7 9 + s 8 C 8 
I ( 8 a(s 5 + s 6 ( 8 + s 7 9 + s 8 ( 8 9) a(s a + s 2 ( 8 + s 3 9 + s 4 Cs 



(16) 



For the same channel model, the structure of the split NVD parallel code is such that, 

1 

7! 



X=4 ^ * 2 I (17) 
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d(r) 




1 2 



Fig. 4. The optimal DMT achievable by the NVD parallel code for the 2x2 block fading channel with N = 2 is 
d(r) — 2(2 — r)(2 — r). The split code achieves the optimal DMT of the block fading channel d(r) = (4 — r)(2 — r). 



As we showed in previous section, the optimal DMT achievable by the NVD parallel is 
only 2(2 — r) (2 — r). However, the optimal DMT achievable by the split code is (4 — r)(2 — r). 
These two DMT are depicted in Figure @] 

For a rate per channel use equal to 4 bpcu (resp. 8 bpcu), the symbols s\, S2, ■ ■ ■ , s% should 
be carved from a BPSK (resp. QPSK) constellation for the scheme with split code and from 
a QPSK (resp. 16QAM) constellation for the scheme with NVD parallel code. One should 
expect here that the gain provided by the use of a smaller size of constellation used in the 
split NVD parallel code to be compensated by the normalization factor l/y/2. Due to the 
gain in DMT, this is not the case and the comparison of both schemes is in Figure [5J It 
can be easily shown there the gain of the split codes compared to the NVD parallel case is 
significant when the spectral efficiency of the code increases. For a small rate of 4 bpcu, a 
small gain can be observed. However, for the rate of 8 bpcu, approximately 5 dB of gain 
can be observed. 
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Error Probability of split code vs. NVD parallel code 




10' 



— B — Split code BPSK - R = 4bpcu 

NVD parallel code QPSK - R = 4bpcu 
- - *- - - Split code QPSK -R = 8bpcu 

v NVD parallel code 16QAM - R = 8bpcu 

_J i i i i | i i i i |_ 



_j i i i_ 



-I I I L_ 



5 10 
SNR(dB) 



15 



20 



Fig. 5. Comparison of split NVD code versus NVD parallel code for a block fading MIMO channel with N = 2 blocks 
and n t = n r — 2. 



VII. Conclusion 

In this paper, we considered the class of selective fading MEMO channel where the channel 
is selective either in time and in frequency. Motivated by the open literature debate on 
the optimal achievable DMT for the block fading channel and using completely different 
arguments than [[51 and JH, we proved here that the optimal DMT expression in §5§ is 
achievable for all the classes of selective fading channels, including the block fading channel. 
Using the geometrical argument, we showed that the outage bound in [Q]| is not limiting for 
the outage probability as claimed in [0. Moreover, a new family of split NVD parallel codes 
to achieve the optimal DMT in [[51 for the case of time or frequency selective channels is 
proposed. 
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Appendix A 
Proof of Theorem Q] 

Let X be the transmitted codeword, X the nearest decoded codeword and AX n = X n — X n 
the difference codeword matrix. The pairwise error probability of the correlated parallel 
channels is upper-bounded as following, 

/ SNR N ~ l \ 
PEP < E H expf-^— ^||H n AX n |||J , 

< E H exp(-^Tr(H w 0Hy), (18) 
where H w denotes the n r x Nn t i.i.d. CA/ r (0, 1) matrix, and 

= (Rjf <g> l nt ) diag { AX n AXt r _1 (R; /2 <g> I nt ) 
is the effective codeword matrix. 

Assuming that Af p (SNR) satisfies the NVD criteria, then D = diag {AX n }^_ Q 1 is a full rank 
matrix with rank equals to Nn t . The rank and the eigenvalues of the effective codeword 
matrix & can be computed using the following lemma |2] 

Lemma 2: Let A be a p x p Hermitian matrix given by, 

A = B(CC t )B t , 

where B is p x p matrix with rank s, C is full rank p x p matrix. Then, the matrix A has 
the following properties: 

a) The rank of A is equal to s, the rank of B. 

b) The non zero eigenvalues \k(A) of A are lower bounded by, 

A fc (A) > A 1 (BBt)A fc (CC t ). (19) 

Proof: The proof of this lemma uses the same matricial tools as flU, and is detailed in 
Appendix [Bj ■ 
By applying Lemma Era to ©, it follows that, 

rank{0} = rank{Rj|/ 2 (g) l nt } 

= rank{RjJ/ 2 }rank{I nt } = pn t . 

By noticing that & is not full rank, the Frobenius norm in (fT8l has the same distribution 
as TrjH^AH^,} where is the n r x pn t effective channel with i.i.d. entries ~ CAf(0, 1) 
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and A is the pn t x pn t diagonal matrix containing the non-zero eigenvalues of the effective 
codeword bounded using Lemma [2]-b such that 

A»(0) > a® A i (DD t ), i = \...pn u 

where o\ is the smallest eigenvalue of R H . 

By following the same footsteps as in [(105) and (108) in l4l|l. this Frobenius norm can 
be bounded such that, 

in 

i=i 

m 

8=1 

where H w denotes the m x pM Jensen channel with i.i.d. CA/"(0, 1) entries such that, 

{[H w o . . . H m if n r < n t , 

(20) 
[ H t,o ••• H t, P -iL if > n t . 

The rest of the proof uses the same technique as presented in 0, JH. It can be deduced 

that if the code satisfies the NVD criteria in ©, then the error region event E a (r, SNR) 

for a given channel realisation a matches with the outage region Q^ pM ^ ( r? SNR) of the 

equivalent m x pM MIMO channel, 

k 

E a (r, SNR) = | ^2 a i> k — r, fc = l,...,m}, 

i=l 

= 0^ pM \r, SNR), (21) 
with ck being the vector containing the eigen exponents of the channel 'H.w'W^, such that 

Hn w ni) = SNR~ a \ 

Appendix B 
Proof of Lemma [2] 

As A is an Hermitian matrix, its rank is equal to the rank of BC. It can be easily checked 
from the product matrix rank property in (|2~2~)> . (D G C axb , E G C 6xc ), 

rank{D} + rank{E} - b < rank{DE} 

< min{rank{D},rank{E}}, (22) 
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and the fact that C is a full rank matrix, that, 

rank{B} + p — p < rank{A} < rank{B}, 

which implies that, rank{A} = rank{B}. 

Using the fact that for a square matrix M e C axa , A(MM f ) = A(M t M), implies that 

A fe (A) = A fe (CtBtBC). 

Let B^B = UATJt be the eigenvalue decomposition of B^B, with A = [A P _ S ]. Then, 

A fc (A) = AfcCCtUAUtC), 

= AfcCA^UtCCtUA 1 / 2 ). 
Let ft = U^(CC^)U and fl be the s x s principal submatrix of fi. Then, 

A fc (A) = A^A^OA 1 / 2 ), (23a) 
= A fc (A V2 f2A 1/2 ), (23b) 

As A in (I23bl) is non singular matrix and fi is Hermitian, The Ostrowski theorem in IfTOl 
can be applied, 

A fc (A) > X 1 (A)X k (n), (23c) 
> Ai(BB t )A fc (ri), (23d) 
= A 1 (BB t )A fe (CC t ). (23e) 

As n is a s x s submatrix of the Hermitian matrix S7, (I23dl) follows from the application 
of theorem 4.3.15 in ifTOll . Finally, (I23el) follows from the fact that U is unitary matrix, and 
therefore A fc (fi) = A fc (CC t ). 
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